Identification of Patients with Family History of Pancreatic Cancer - Investigation of an NLP System Portability

نویسندگان

  • Saeed Mehrabi
  • Anand Krishnan
  • Alexandra M. Roch
  • Heidi Schmidt
  • Dingcheng Li
  • Joe Kesterson
  • Chris Beesley
  • Paul R. Dexter
  • C. Max Schmidt
  • Mathew J. Palakal
  • Hongfang Liu
چکیده

In this study we have developed a rule-based natural language processing (NLP) system to identify patients with family history of pancreatic cancer. The algorithm was developed in a Unstructured Information Management Architecture (UIMA) framework and consisted of section segmentation, relation discovery, and negation detection. The system was evaluated on data from two institutions. The family history identification precision was consistent across the institutions shifting from 88.9% on Indiana University (IU) dataset to 87.8% on Mayo Clinic dataset. Customizing the algorithm on the the Mayo Clinic data, increased its precision to 88.1%. The family member relation discovery achieved precision, recall, and F-measure of 75.3%, 91.6% and 82.6% respectively. Negation detection resulted in precision of 99.1%. The results show that rule-based NLP approaches for specific information extraction tasks are portable across institutions; however customization of the algorithm on the new dataset improves its performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thoracoscopic Splanchnicectomy for Pain Control in Irresectable Pancreatic Cancer

Introduction : Severepain is a major problem in patients with unresectable pancreatic cancer. The goal of this study is to evaluate the effects of Thoracoscopic Splanchnicectomy (TS) on pain control in these patients suffering from unresectable pancreatic cancer. Methods:Between years 2000 to 2011, 20 patients suffering from unresectable pancreatic cancer underwent TS due to severe pain. They w...

متن کامل

Health education models application by peer group for improving breast cancer screening among Iranian women with a family history of breast cancer: A randomized control trial

    Background: Studies have shown that participation of Iranian women with family history of breast cancer in screening service is low. This investigation has evaluated the effectiveness of health models according to peer group in improving clinical breast exam (CBE) among Iranian women with a family history of breast cancer.    Methods: This was a randomized control ...

متن کامل

The impact of BMI, Smoking, Family History and Ala 119 Ser (rs1056827) Polymorphism of CYP1B1*2 Genes with Susceptibility to Prostate Cancer among Iranian Men

Background and Aims: The genes involved in detoxification and the elimination of toxic metabolites have a vital role in cancer pathogenesis. Also, there is evidence that higher amounts of body fat are associated with increased risks of several cancers. The current study aims to identify the relationship of age, body mass index (BMI), smoking, family history, and polymorphism rs1056827 of CYP1B1...

متن کامل

Comparing methods for identifying pancreatic cancer patients using electronic data sources.

We sought to determine the accuracy of two electronic methods of identifying pancreatic cancer in a cohort of pancreatic cyst patients, and to examine the reasons for identification failure. We used the International Classification of Diseases, 9(th) Edition (ICD-9) codes and natural language processing (NLP) technology to identify pancreatic cancer in these patients. We compared both methods t...

متن کامل

Investigation of HER-2 expression and its Correlation with clinicopathological parameters and overall survival of esophageal squamous cell carcinoma patients

Background & Objective: Human epidermal growth factor receptor 2 (HER-2) exhibits a vast range of expression in esophageal squamous...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Studies in health technology and informatics

دوره 216  شماره 

صفحات  -

تاریخ انتشار 2015